NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
نویسندگان
چکیده
Shape string is structural sequence and is an extremely important structure representation of protein backbone conformations. Nuclear magnetic resonance chemical shifts give a strong correlation with the local protein structure, and are exploited to predict protein structures in conjunction with computational approaches. Here we demonstrate a novel approach, NMRDSP, which can accurately predict the protein shape string based on nuclear magnetic resonance chemical shifts and structural profiles obtained from sequence data. The NMRDSP uses six chemical shifts (HA, H, N, CA, CB and C) and eight elements of structure profiles as features, a non-redundant set (1,003 entries) as the training set, and a conditional random field as a classification algorithm. For an independent testing set (203 entries), we achieved an accuracy of 75.8% for S8 (the eight states accuracy) and 87.8% for S3 (the three states accuracy). This is higher than only using chemical shifts or sequence data, and confirms that the chemical shift and the structure profile are significant features for shape string prediction and their combination prominently improves the accuracy of the predictor. We have constructed the NMRDSP web server and believe it could be employed to provide a solid platform to predict other protein structures and functions. The NMRDSP web server is freely available at http://cal.tongji.edu.cn/NMRDSP/index.jsp.
منابع مشابه
HASH: a program to accurately predict protein Hα shifts from neighboring backbone shifts.
Chemical shifts provide not only peak identities for analyzing nuclear magnetic resonance (NMR) data, but also an important source of conformational information for studying protein structures. Current structural studies requiring H(α) chemical shifts suffer from the following limitations. (1) For large proteins, the H(α) chemical shifts can be difficult to assign using conventional NMR triple-...
متن کاملCS23D: a web server for rapid protein structure generation using NMR chemical shifts and sequence data
CS23D (chemical shift to 3D structure) is a web server for rapidly generating accurate 3D protein structures using only assigned nuclear magnetic resonance (NMR) chemical shifts and sequence data as input. Unlike conventional NMR methods, CS23D requires no NOE and/or J-coupling data to perform its calculations. CS23D accepts chemical shift files in either SHIFTY or BMRB formats, and produces a ...
متن کاملFast and accurate predictions of protein NMR chemical shifts from interatomic distances.
We present a method, CamShift, for the rapid and accurate prediction of NMR chemical shifts from protein structures. The calculations performed by CamShift are based on an approximate expression of the chemical shifts in terms of polynomial functions of interatomic distances. Since these functions are very fast to compute and readily differentiable, the CamShift approach can be utilized in stan...
متن کاملAccurate and automated classification of protein secondary structure with PsiCSI.
PsiCSI is a highly accurate and automated method of assigning secondary structure from NMR data, which is a useful intermediate step in the determination of tertiary structures. The method combines information from chemical shifts and protein sequence using three layers of neural networks. Training and testing was performed on a suite of 92 proteins (9437 residues) with known secondary and tert...
متن کاملAccurate calculation, prediction, and assignment of 3He NMR chemical shifts of helium-3-encapsulated fullerenes and fullerene derivatives.
Helium-3 NMR chemical shifts of various (3)He-encapsulated fullerenes ((3)He@C(n)()) and their derivatives have been calculated at the GIAO-B3LYP/3-21G and GIAO-HF/3-21G levels with AM1 and PM3 optimized structures. A good linear relationship between the computed (3)He NMR chemical shifts and the experimental data has been determined. Comparisons of the calculation methods of (3)He NMR chemical...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 8 شماره
صفحات -
تاریخ انتشار 2013